Feature Allocations, Probability Functions, and Paintboxes
نویسندگان
چکیده
The problem of inferring a clustering of a data set has been the subject of much research in Bayesian analysis, and there currently exists a solid mathematical foundation for Bayesian approaches to clustering. In particular, the class of probability distributions over partitions of a data set has been characterized in a number of ways, including via exchangeable partition probability functions (EPPFs) and the Kingman paintbox. Here, we develop a generalization of the clustering problem, called feature allocation, where we allow each data point to belong to an arbitrary, non-negative integer number of groups, now called features or topics. We define and study an “exchangeable feature probability function” (EFPF)—analogous to the EPPF in the clustering setting—for certain types of feature models. Moreover, we introduce a “feature paintbox” characterization—analogous to the Kingman paintbox for clustering—of the class of exchangeable feature models. We provide a further characterization of the subclass of feature allocations that have EFPF representations.
منابع مشابه
Some New Results on Policy Limit Allocations
Suppose that a policyholder faces $n$ risks X1, ..., Xn which are insured under the policy limit with the total limit of l. Usually, the policyholder is asked to protect each Xi with an arbitrary limit of li such that ∑ni=1li=l. If the risks are independent and identically distributed with log-concave cumulative distribution function, using the notions of majorization and stochastic orderings, ...
متن کاملA characterization of product-form exchangeable feature probability functions
We characterize the class of exchangeable feature allocations assigning probability Vn,k ∏k l=1WmlUn−ml to a feature allocation of n individuals, displaying k features with counts (m1, . . . ,mk) for these features. Each element of this class is parametrized by a countable matrix V and two sequences U and W of non-negative weights. Moreover, a consistency condition is imposed to guarantee that ...
متن کاملAsymptotic existence of proportionally fair allocations
Fair division has long been an important problem in the economics literature. In this note, we consider the existence of proportionally fair allocations of indivisible goods, i.e., allocations of indivisible goods in which every agent gets at least her proportionally fair share according to her own utility function. We show that when utilities are additive and utilities for individual goods are...
متن کاملMetaprogramming for the Generation of Nonparametric Curves
One of the most important functions of paintboxes is drawing curves. These primitives have been programmed and the user can never add a new program which computes the discrete points of a given function. Using metaprogramming and the Jordan’s method, our program CAPC automatically generates, for a given function, a new program which computes the discrete points for this function and adds it to ...
متن کاملA Self-organized Multi Agent Decision Making System Based on Fuzzy Probabilities: The Case of Aphasia Diagnosis
Aphasia diagnosis is a challenging medical diagnostic task due to the linguistic uncertainty and vagueness, large number of measurements with imprecision, inconsistencies in the definition of Aphasic syndromes, natural diversity and subjectivity in test objects as well as in options of experts who diagnose the disease. In this paper we present a new self-organized multi agent system that diagno...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013